2,474 research outputs found

    A hierarchic task-based programming model for distributed heterogeneous computing

    Get PDF
    Distributed computing platforms are evolving to heterogeneous ecosystems with Clusters, Grids and Clouds introducing in its computing nodes, processors with different core architectures, accelerators (i.e. GPUs, FPGAs), as well as different memories and storage devices in order to achieve better performance with lower energy consumption. As a consequence of this heterogeneity, programming applications for these distributed heterogeneous platforms becomes a complex task. Additionally to the complexity of developing an application for distributed platforms, developers must also deal now with the complexity of the different computing devices inside the node. In this article, we present a programming model that aims to facilitate the development and execution of applications in current and future distributed heterogeneous parallel architectures. This programming model is based on the hierarchical composition of the COMP Superscalar and Omp Superscalar programming models that allow developers to implement infrastructure-agnostic applications. The underlying runtime enables applications to adapt to the infrastructure without the need of maintaining different versions of the code. Our programming model proposal has been evaluated on real platforms, in terms of heterogeneous resource usage, performance and adaptation.This work has been supported by the European Commission through the Horizon 2020 Research and Innovation program under contract 687584 (TANGO project) by the Spanish Government under contract TIN2015-65316 and grant SEV-2015-0493 (Severo Ochoa Program) and by Generalitat de Catalunya under contracts 2014-SGR-1051 and 2014-SGR-1272.Peer ReviewedPostprint (author's final draft

    Fast multipole method applied to 3D frequency domain elastodynamics

    Get PDF
    This article is concerned with the formulation and implementation of a fast multipole-accelerated BEM for 3-D elastodynamics in the frequency domain, based on the so-called diagonal form for the expansion of the elastodynamic fundamental solution, a multi-level strategy. As usual with the FM-BEM, the linear system of BEM equations is solved by GMRES, and the matrix is never explicitly formed. The truncation parameter in the multipole expansion is adjusted to the level, a feature known from recent published studies for the Maxwell equations. A preconditioning strategy based on the concept of sparse approximate inverse (SPAI) is presented and implemented. The proposed formulation is assessed on numerical examples involving O(105)O(10^{5}) BEM unknowns, which show in particular that, as expected, the proposed FM-BEM is much faster than the traditional BEM, and that the GMRES iteration count is significantly reduced when the SPAI preconditioner is used

    Unbalanced sediment budgets in the catchment-alluvial fan system of the Kuitun River (northern Tian Shan, China): Implications for mass-balance estimates, denudation and sedimentation rates in orogenic systems

    Get PDF
    International audienceMass balances are often used to calculate sediment fluxes in foreland basins and denudation rates in adjacent mountain ranges on intermediate to long timescales (from a few tens of thousand to a million years). Here, we study the simple Quaternary catchment-alluvial fan system of the Kuitun River, in northern Tian Shan, to discuss some ideas about sediment storage, release, and bypass in relatively short (100 km long) sediment routing systems. This study shows that the Kuitun catchment and piedmont areas clearly present evidence of a significant and temporary storage of sediments during the Pleistocene. These sediments were then excavated and delivered farther into the foreland basin during the Holocene. The difference between the volumes of materials released from the catchment and piedmont areas (5.5 ± 1.7 km3) and the volume stored in a contemporaneous fan downstream (2.6 ± 0.6 km3) indicates that the latter did not trap the whole sediment load transported by the river. The alluvial fan was bypassed by 27 to 78% of this load toward its distal alluvial plain. If this value is well estimated, it implies a major volumetric partitioning of the deposits between the fan and the alluvial plain, with a very high sedimentation rate in the fan (1.97 ± 0.52 mm*y− 1) and a much lower one downstream (0.11 ± 0.11 mm*y− 1). However, this volumetric partitioning might only occur during periods with a very specific hydrological regime such as the Holocene deglaciation. Eventually, the peculiar sediment storage and release pattern within the Kuitun catchment and piedmont areas during the Pleistocene and Holocene complicates the calculation of mean paleodenudation rates using either sediment budgets or in situ produced cosmogenic nuclides

    Predicting Future Space Near-IR Grism Surveys using the WFC3 Infrared Spectroscopic Parallels Survey

    Get PDF
    We present near-infrared emission line counts and luminosity functions from the HST WFC3 Infrared Spectroscopic Parallels (WISP) program for 29 fields (0.037 deg^2) observed using both the G102 and G141 grisms. Altogether we identify 1048 emission line galaxies with observed equivalent widths greater than 40 Angstroms, 467 of which have multiple detected emission lines. The WISP survey is sensitive to fainter flux levels (3-5x10^-17 ergs/s/cm^2) than the future space near-infrared grism missions aimed at baryonic acoustic oscillation cosmology (1-4x10^-16 ergs/s/cm^2), allowing us to probe the fainter emission line galaxies that the shallower future surveys may miss. Cumulative number counts of 0.7<z<1.5 galaxies reach 10,000 deg^-2 above an H-alpha flux of 2x10^-16 ergs/s/cm^2. H-alpha-emitting galaxies with comparable [OIII] flux are roughly 5 times less common than galaxies with just H-alpha emission at those flux levels. Galaxies with low H-alpha/[OIII] ratios are very rare at the brighter fluxes that future near-infrared grism surveys will probe; our survey finds no galaxies with H-alpha/[OIII] < 0.95 that have H-alpha flux greater than 3x10^-16 ergs/s/cm^2. Our H-alpha luminosity function contains a comparable number density of faint line emitters to that found by the NICMOS near-infrared grism surveys, but significantly fewer (factors of 3-4 less) high luminosity emitters. We also find that our high redshift (z=0.9-1.5) counts are in agreement with the high redshift (z=1.47) narrow band H-alpha survey of HiZELS (Sobral et al. 2013), while our lower redshift luminosity function (z=0.3-0.9) falls slightly below their z=0.84 result. The evolution in both the H-alpha luminosity function from z=0.3--1.5 and the [OIII] luminosity function from z=0.7-2.3 is almost entirely in the L* parameter, which steadily increases with redshift over those ranges.Comment: 18 pages, 14 figures, Accepted by Ap

    Mapping Orthorhombic Domains with Geometrical Phase Analysis in Rare-Earth Nickelate Heterostructures

    Full text link
    Most perovskite oxides belong to the Pbnm space group, composed by an anisotropic unit cell, A-site antipolar displacements and oxygen octahedral tilts. Mapping the orientation of the orthorhombic unit cell in epitaxial heterostructures that consist of at least one Pbnm compound is often required to understand and control the different degrees of coupling established at their coherent interfaces and, therefore, their resulting physical properties. However, retrieving this information from the strain maps generated with high-resolution scanning transmission electron microscopy can be challenging, because the three pseudocubic lattice parameters are very similar in these systems. Here, we present a novel methodology for mapping the crystallographic orientation in Pbnm systems. It makes use of the geometrical phase analysis algorithm, as applied to aberration-corrected scanning transition electron microscopy images, but in an unconventional way. The method is fast and robust, giving real-space maps of the lattice orientations in Pbnm systems, from both cross-sectional and plan-view geometries and across large fields of view. As an example, we apply our methodology to rare-earth nickelate heterostructures, in order to investigate how the crystallographic orientation of these films depends on various structural constraints that are imposed by the underlying single crystal substrates. We observe that the resulting domain distributions and associated defect landscapes mainly depend on a competition between the epitaxial compressive/tensile and shear strains, together with the matching of atomic displacements at the substrate/film interface. The results point towards strategies for controlling these characteristics by appropriate substrate choice.Comment: 32 pages, 5 figures, 2 table

    Physical Properties of Emission-Line Galaxies at z ~ 2 from Near-Infrared Spectroscopy with Magellan FIRE

    Get PDF
    We present results from near-infrared spectroscopy of 26 emission-line galaxies at z ~ 2 obtained with the FIRE spectrometer on the Magellan Baade telescope. The sample was selected from the WISP survey, which uses the near-infrared grism of the Hubble Space Telescope Wide Field Camera 3 to detect emission-line galaxies over 0.3 < z < 2.3. Our FIRE follow-up spectroscopy (R~5000) over 1.0-2.5 micron permits detailed measurements of physical properties of the z~2 emission-line galaxies. Dust-corrected star formation rates for the sample range from ~5-100 M_sun yr-1. We derive a median metallicity for the sample of ~0.45 Z_sun, and the estimated stellar masses range from ~10^8.5 - 10^9.5 M_sun. The average ionization parameters measured for the sample are typically much higher than what is found for local star-forming galaxies. We derive composite spectra from the FIRE sample, from which we infer typical nebular electron densities of ~100-400 cm^-3. Based on the location of the galaxies and composite spectra on BPT diagrams, we do not find evidence for significant AGN activity in the sample. Most of the galaxies as well as the composites are offset in the BPT diagram toward higher [O III]/H-beta at a given [N II]/H-alpha, in agreement with other observations of z > 1 star-forming galaxies, but composite spectra derived from the sample do not show an appreciable offset from the local star-forming sequence on the [O III]/H-beta versus [S II]/H-alpha diagram. We infer a high nitrogen-to-oxygen abundance ratio from the composite spectrum, which may contribute to the offset of the high-redshift galaxies from the local star-forming sequence in the [O III]/H-beta versus [N II]/H-alpha diagram. We speculate that the elevated nitrogen abundance could result from substantial numbers of Wolf-Rayet stars in starbursting galaxies at z~2. (Abridged)Comment: Accepted for publication in Ap

    Deep crustal faults and the origin and long-term flank stability of Mt. Etna : first results from the CIRCEE cruise (Oct. 2013)

    Get PDF
    The relation between deep crustal faults and the origin of Mount Etna, the largest and most active volcano in Europe has long been suspected due to its unusual geodynamic location. Results from a new marine geophysical survey offshore Eastern Sicily reveal the detailed geometry (location, length, dip and orientation) of a two-branched 200-km long, lithospheric scale fault system, long sought for as being the cause of Mount Etna. Using high-resolution bathymetry and seismic profiling, we image a 60-km long, previously unidentified, NW trending fault with evidence of recent displacement at the seafloor, offsetting Holocene sediments. This newly identified fault connects NE of Catania, to a known 40-km long, offshore-onshore fault system dissecting the southeastern flank of Mount Etna, generally interpreted as purely gravitational collapse structures.peer-reviewe

    Groupe de travail COFEND sur la simulation des END par courants de Foucault – Bilan d'activités

    Get PDF
    National audienceAs an answer to one of the main concerns expressed by the attendees of the first COFREND technical meeting on eddy current NDE simulation held on October 19, 2004, the "eddy current modelling" COFREND Working Group has oriented its activities towards the definition and resolution of benchmark problems, starting from the beginning of 2005.The aim of the Working Group is to define benchmark problems representative of actual industrial issues and uncovered by existing benchmarks. The proposed problems must, as much as possible, be simple (an exact description of the actual industrial problem is not advised) and have complete and accessible descriptive and experimental data. They are meant to be granted a public access and to be open to a resolution by any applicant code.At the present time, the Working Group has defined 4 benchmarks inspired by issues from aerospace and nuclear industries, as well as from laboratory trials, and is studying other proposals. The first resolution examples are presented for two of them, implying both finite element and semi-analytic codes.En réponse à l'une des préoccupations exprimées par les participants à la première journée technique de la COFREND sur la simulation des END par courants de Foucault, le 19 Octobre 2004, le groupe de travail COFREND "modélisation courants de Foucault" a orienté ses activités vers la définition et la résolution de cas-tests (ou "benchmarking") à partir de début 2005.L'objectif du groupe de travail est de définir des cas-tests représentatifs de problématiques industrielles non couvertes par les cas-tests connus de la bibliographie. Les problèmes proposés doivent autant que possible être relativement simples (la description exacte du problème industriel n'est pas recommandée) et disposer de données descriptives et expérimentales complètes et accessibles. Ils ont pour but d'être rendus publics et ouverts à la résolution de tout code de calcul candidat.Actuellement, le groupe de travail a défini 4 cas-tests inspirés de problématiques issues des industries aéronautiques et nucléaires, ainsi que d'essais de laboratoire, et il étudie d'autres propositions. Des premiers exemples de résolution sont présentés pour deux d'entre eux, en utilisant des codes faisant appel aux éléments finis et aux méthodes semi-analytiques

    Reusable Verification Environment for a RISC-V Vector Accelerator

    Get PDF
    This paper presents a reusable verification environment developed for the verification of an academic RISC-V based vector accelerator that operates with long vectors. In order to be used across diverse projects, this infrastructure intends to be independent of the interface used for connecting the accelerator to the scalar processor core. We built a verification infrastructure consisting of a Universal Verification Environment (UVM) which is capable of validating the design performing co-simulation of the vector instructions. Moreover, we provided a set of tests and an automated test generation, simulation and error reporting infrastructure. This paper shares our experience on verifying a complex accelerator used in two distinct projects, with different interfaces.This research has received funding from the European High Performance Computing Joint Undertaking (JU) under Framework Partnership Agreement No 800928 (European Processor Initiative) and Specific Grant Agreement No 101036168 (EPI SGA2) and No 956702 (eProcessor) . The JU receives support from the European Union’s Horizon 2020 research and innovation programme and from Croatia, France, Germany, Greece, Italy, Netherlands, Portugal, Spain, Sweden, and Switzerland. The EPI-SGA2 project, PCI2022-132935_N1618737 is also co-funded by MCIN/AEI /10.13039/501100011033 and by the UE NextGenerationEU/PRTRPeer ReviewedPostprint (author's final draft
    • …
    corecore